Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 206593 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 50.4 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Numeric | 20 |
|---|---|
| Categorical | 12 |
id has a high cardinality: 206593 distinct values | High cardinality |
first_browser has a high cardinality: 52 distinct values | High cardinality |
df_index is highly correlated with year_first_active and 1 other fields | High correlation |
days_from_first_active_until_booking is highly correlated with days_from_account_created_until_first_booking and 1 other fields | High correlation |
days_from_account_created_until_first_booking is highly correlated with days_from_first_active_until_booking and 1 other fields | High correlation |
year_first_active is highly correlated with df_index and 1 other fields | High correlation |
month_first_active is highly correlated with weekodyear_first_active and 2 other fields | High correlation |
day_first_active is highly correlated with day_first_created_account | High correlation |
dayofweek_first_active is highly correlated with dayofweek_first_created_account | High correlation |
weekodyear_first_active is highly correlated with month_first_active and 2 other fields | High correlation |
year_first_booking is highly correlated with days_from_first_active_until_booking and 1 other fields | High correlation |
month_first_booking is highly correlated with weekofyear_first_booking | High correlation |
weekofyear_first_booking is highly correlated with month_first_booking | High correlation |
year_first_created_account is highly correlated with df_index and 1 other fields | High correlation |
month_first_created_account is highly correlated with month_first_active and 2 other fields | High correlation |
day_first_created_account is highly correlated with day_first_active | High correlation |
dayofweek_first_created_account is highly correlated with dayofweek_first_active | High correlation |
weekofyear_first_created_account is highly correlated with month_first_active and 2 other fields | High correlation |
days_from_first_active_until_account_created is highly skewed (γ1 = 69.29642597) | Skewed |
id is uniformly distributed | Uniform |
df_index has unique values | Unique |
id has unique values | Unique |
signup_flow has 162557 (78.7%) zeros | Zeros |
days_from_first_active_until_booking has 20738 (10.0%) zeros | Zeros |
days_from_first_active_until_account_created has 206421 (99.9%) zeros | Zeros |
days_from_account_created_until_first_booking has 20741 (10.0%) zeros | Zeros |
dayofweek_first_active has 31837 (15.4%) zeros | Zeros |
dayofweek_first_booking has 12407 (6.0%) zeros | Zeros |
dayofweek_first_created_account has 31830 (15.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-18 00:29:33.179006 |
|---|---|
| Analysis finished | 2021-05-18 00:31:04.872108 |
| Duration | 1 minute and 31.69 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 206593 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108738.5086 |
|---|---|
| Minimum | 0 |
| Maximum | 213450 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13394.6 |
| Q1 | 56479 |
| median | 108694 |
| Q3 | 161529 |
| 95-th percentile | 203114.4 |
| Maximum | 213450 |
| Range | 213450 |
| Interquartile range (IQR) | 105050 |
Descriptive statistics
| Standard deviation | 60750.85989 |
|---|---|
| Coefficient of variation (CV) | 0.558687632 |
| Kurtosis | -1.187821666 |
| Mean | 108738.5086 |
| Median Absolute Deviation (MAD) | 52523 |
| Skewness | -0.01019791288 |
| Sum | 2.246461471 × 1010 |
| Variance | 3690666977 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 177404 | 1 | < 0.1% |
| 181490 | 1 | < 0.1% |
| 183539 | 1 | < 0.1% |
| 193780 | 1 | < 0.1% |
| 195829 | 1 | < 0.1% |
| 189686 | 1 | < 0.1% |
| 191735 | 1 | < 0.1% |
| 169208 | 1 | < 0.1% |
| 171257 | 1 | < 0.1% |
| Other values (206583) | 206583 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 |
| Value | Count | Frequency (%) |
| 213450 | 1 | |
| 213449 | 1 | |
| 213448 | 1 | |
| 213447 | 1 | |
| 213446 | 1 |
| Distinct | 206593 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| juobazgx6x | 1 |
|---|---|
| fcmfxdpuvn | 1 |
| ho9ag8jmbi | 1 |
| cwego3scrv | 1 |
| vr1idsa3bo | 1 |
| Other values (206588) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2065930 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 206593 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | gxn3p5htnn |
|---|---|
| 2nd row | 820tgsjxq7 |
| 3rd row | 4ft3gnwmtx |
| 4th row | bjjt8pjhuk |
| 5th row | 87mebub9p4 |
| Value | Count | Frequency (%) |
| juobazgx6x | 1 | < 0.1% |
| fcmfxdpuvn | 1 | < 0.1% |
| ho9ag8jmbi | 1 | < 0.1% |
| cwego3scrv | 1 | < 0.1% |
| vr1idsa3bo | 1 | < 0.1% |
| fsjqo33cmg | 1 | < 0.1% |
| nvs2d8fgwp | 1 | < 0.1% |
| ztsbm4t3hw | 1 | < 0.1% |
| 25j3ljgh1g | 1 | < 0.1% |
| a79o2s27r0 | 1 | < 0.1% |
| Other values (206583) | 206583 |
| Value | Count | Frequency (%) |
| juobazgx6x | 1 | < 0.1% |
| fcmfxdpuvn | 1 | < 0.1% |
| ho9ag8jmbi | 1 | < 0.1% |
| cwego3scrv | 1 | < 0.1% |
| vr1idsa3bo | 1 | < 0.1% |
| fsjqo33cmg | 1 | < 0.1% |
| nvs2d8fgwp | 1 | < 0.1% |
| ztsbm4t3hw | 1 | < 0.1% |
| 25j3ljgh1g | 1 | < 0.1% |
| a79o2s27r0 | 1 | < 0.1% |
| Other values (206583) | 206583 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 57855 | 2.8% |
| h | 57719 | 2.8% |
| y | 57712 | 2.8% |
| o | 57682 | 2.8% |
| f | 57593 | 2.8% |
| 4 | 57586 | 2.8% |
| 1 | 57560 | 2.8% |
| b | 57558 | 2.8% |
| j | 57550 | 2.8% |
| i | 57525 | 2.8% |
| Other values (26) | 1489590 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1492613 | |
| Decimal Number | 573317 | 27.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 57855 | 3.9% |
| h | 57719 | 3.9% |
| y | 57712 | 3.9% |
| o | 57682 | 3.9% |
| f | 57593 | 3.9% |
| b | 57558 | 3.9% |
| j | 57550 | 3.9% |
| i | 57525 | 3.9% |
| w | 57514 | 3.9% |
| a | 57500 | 3.9% |
| Other values (16) | 916405 |
| Value | Count | Frequency (%) |
| 4 | 57586 | |
| 1 | 57560 | |
| 2 | 57493 | |
| 7 | 57460 | |
| 3 | 57364 | |
| 8 | 57349 | |
| 9 | 57336 | |
| 0 | 57106 | |
| 5 | 57052 | |
| 6 | 57011 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1492613 | |
| Common | 573317 | 27.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 57855 | 3.9% |
| h | 57719 | 3.9% |
| y | 57712 | 3.9% |
| o | 57682 | 3.9% |
| f | 57593 | 3.9% |
| b | 57558 | 3.9% |
| j | 57550 | 3.9% |
| i | 57525 | 3.9% |
| w | 57514 | 3.9% |
| a | 57500 | 3.9% |
| Other values (16) | 916405 |
| Value | Count | Frequency (%) |
| 4 | 57586 | |
| 1 | 57560 | |
| 2 | 57493 | |
| 7 | 57460 | |
| 3 | 57364 | |
| 8 | 57349 | |
| 9 | 57336 | |
| 0 | 57106 | |
| 5 | 57052 | |
| 6 | 57011 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2065930 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 57855 | 2.8% |
| h | 57719 | 2.8% |
| y | 57712 | 2.8% |
| o | 57682 | 2.8% |
| f | 57593 | 2.8% |
| 4 | 57586 | 2.8% |
| 1 | 57560 | 2.8% |
| b | 57558 | 2.8% |
| j | 57550 | 2.8% |
| i | 57525 | 2.8% |
| Other values (26) | 1489590 |
gender
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| -unknown- | |
|---|---|
| FEMALE | |
| MALE | |
| OTHER | 275 |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.816382936 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1408217 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -unknown- |
|---|---|
| 2nd row | MALE |
| 3rd row | FEMALE |
| 4th row | FEMALE |
| 5th row | -unknown- |
| Value | Count | Frequency (%) |
| -unknown- | 91706 | |
| FEMALE | 61520 | |
| MALE | 53092 | |
| OTHER | 275 | 0.1% |
| Value | Count | Frequency (%) |
| unknown | 91706 | |
| female | 61520 | |
| male | 53092 | |
| other | 275 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 275118 | |
| - | 183412 | |
| E | 176407 | |
| M | 114612 | |
| A | 114612 | |
| L | 114612 | |
| u | 91706 | 6.5% |
| k | 91706 | 6.5% |
| o | 91706 | 6.5% |
| w | 91706 | 6.5% |
| Other values (5) | 62620 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 641942 | |
| Uppercase Letter | 582863 | |
| Dash Punctuation | 183412 | 13.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 176407 | |
| M | 114612 | |
| A | 114612 | |
| L | 114612 | |
| F | 61520 | 10.6% |
| O | 275 | < 0.1% |
| T | 275 | < 0.1% |
| H | 275 | < 0.1% |
| R | 275 | < 0.1% |
| Value | Count | Frequency (%) |
| n | 275118 | |
| u | 91706 | 14.3% |
| k | 91706 | 14.3% |
| o | 91706 | 14.3% |
| w | 91706 | 14.3% |
| Value | Count | Frequency (%) |
| - | 183412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1224805 | |
| Common | 183412 | 13.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 275118 | |
| E | 176407 | |
| M | 114612 | |
| A | 114612 | |
| L | 114612 | |
| u | 91706 | 7.5% |
| k | 91706 | 7.5% |
| o | 91706 | 7.5% |
| w | 91706 | 7.5% |
| F | 61520 | 5.0% |
| Other values (4) | 1100 | 0.1% |
| Value | Count | Frequency (%) |
| - | 183412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1408217 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 275118 | |
| - | 183412 | |
| E | 176407 | |
| M | 114612 | |
| A | 114612 | |
| L | 114612 | |
| u | 91706 | 6.5% |
| k | 91706 | 6.5% |
| o | 91706 | 6.5% |
| w | 91706 | 6.5% |
| Other values (5) | 62620 | 4.4% |
age
Real number (ℝ≥0)
| Distinct | 99 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.11742411 |
|---|---|
| Minimum | 16 |
| Maximum | 115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 32 |
| median | 49 |
| Q3 | 49 |
| 95-th percentile | 57 |
| Maximum | 115 |
| Range | 99 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.156497 |
|---|---|
| Coefficient of variation (CV) | 0.28863344 |
| Kurtosis | 4.626116599 |
| Mean | 42.11742411 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.001974223 |
| Sum | 8701165 |
| Variance | 147.7804194 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 49 | 85257 | |
| 30 | 6039 | 2.9% |
| 31 | 5935 | 2.9% |
| 29 | 5894 | 2.9% |
| 28 | 5862 | 2.8% |
| 32 | 5763 | 2.8% |
| 27 | 5671 | 2.7% |
| 33 | 5455 | 2.6% |
| 26 | 4960 | 2.4% |
| 34 | 4940 | 2.4% |
| Other values (89) | 70817 |
| Value | Count | Frequency (%) |
| 16 | 26 | < 0.1% |
| 17 | 64 | < 0.1% |
| 18 | 665 | |
| 19 | 1097 | |
| 20 | 533 |
| Value | Count | Frequency (%) |
| 115 | 12 | < 0.1% |
| 113 | 4 | < 0.1% |
| 112 | 1 | < 0.1% |
| 111 | 2 | < 0.1% |
| 110 | 188 |
signup_method
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| basic | |
|---|---|
| 546 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 5.850861355 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1208747 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | basic |
| 4th row | |
| 5th row | basic |
| Value | Count | Frequency (%) |
| basic | 147635 | |
| 58412 | 28.3% | |
| 546 | 0.3% |
| Value | Count | Frequency (%) |
| basic | 147635 | |
| 58412 | 28.3% | |
| 546 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 206047 | |
| c | 206047 | |
| b | 206047 | |
| s | 147635 | |
| i | 147635 | |
| o | 117916 | |
| e | 58958 | 4.9% |
| f | 58412 | 4.8% |
| k | 58412 | 4.8% |
| g | 1092 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1208747 |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 206047 | |
| c | 206047 | |
| b | 206047 | |
| s | 147635 | |
| i | 147635 | |
| o | 117916 | |
| e | 58958 | 4.9% |
| f | 58412 | 4.8% |
| k | 58412 | 4.8% |
| g | 1092 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1208747 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 206047 | |
| c | 206047 | |
| b | 206047 | |
| s | 147635 | |
| i | 147635 | |
| o | 117916 | |
| e | 58958 | 4.9% |
| f | 58412 | 4.8% |
| k | 58412 | 4.8% |
| g | 1092 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1208747 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 206047 | |
| c | 206047 | |
| b | 206047 | |
| s | 147635 | |
| i | 147635 | |
| o | 117916 | |
| e | 58958 | 4.9% |
| f | 58412 | 4.8% |
| k | 58412 | 4.8% |
| g | 1092 | 0.1% |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.156946266 |
|---|---|
| Minimum | 0 |
| Maximum | 25 |
| Zeros | 162557 |
| Zeros (%) | 78.7% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 25 |
| Maximum | 25 |
| Range | 25 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.550683626 |
|---|---|
| Coefficient of variation (CV) | 2.39176818 |
| Kurtosis | 3.551615681 |
| Mean | 3.156946266 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.283783603 |
| Sum | 652203 |
| Variance | 57.01282322 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 162557 | |
| 25 | 13724 | 6.6% |
| 12 | 8897 | 4.3% |
| 3 | 7550 | 3.7% |
| 2 | 5522 | 2.7% |
| 24 | 3975 | 1.9% |
| 23 | 2793 | 1.4% |
| 1 | 837 | 0.4% |
| 6 | 240 | 0.1% |
| 8 | 237 | 0.1% |
| Other values (7) | 261 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 162557 | |
| 1 | 837 | 0.4% |
| 2 | 5522 | 2.7% |
| 3 | 7550 | 3.7% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 25 | 13724 | |
| 24 | 3975 | 1.9% |
| 23 | 2793 | 1.4% |
| 21 | 195 | 0.1% |
| 20 | 14 | < 0.1% |
language
Categorical
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| en | |
|---|---|
| zh | 1599 |
| fr | 1146 |
| es | 888 |
| ko | 720 |
| Other values (20) | 2604 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 413186 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 199636 | |
| zh | 1599 | 0.8% |
| fr | 1146 | 0.6% |
| es | 888 | 0.4% |
| ko | 720 | 0.3% |
| de | 715 | 0.3% |
| it | 489 | 0.2% |
| ru | 378 | 0.2% |
| pt | 234 | 0.1% |
| ja | 224 | 0.1% |
| Other values (15) | 564 | 0.3% |
| Value | Count | Frequency (%) |
| en | 199636 | |
| zh | 1599 | 0.8% |
| fr | 1146 | 0.6% |
| es | 888 | 0.4% |
| ko | 720 | 0.3% |
| de | 715 | 0.3% |
| it | 489 | 0.2% |
| ru | 378 | 0.2% |
| pt | 234 | 0.1% |
| ja | 224 | 0.1% |
| Other values (15) | 564 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 201263 | |
| n | 199760 | |
| h | 1641 | 0.4% |
| z | 1599 | 0.4% |
| r | 1589 | 0.4% |
| f | 1160 | 0.3% |
| s | 1046 | 0.3% |
| t | 809 | 0.2% |
| d | 795 | 0.2% |
| o | 750 | 0.2% |
| Other values (9) | 2774 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 413186 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 201263 | |
| n | 199760 | |
| h | 1641 | 0.4% |
| z | 1599 | 0.4% |
| r | 1589 | 0.4% |
| f | 1160 | 0.3% |
| s | 1046 | 0.3% |
| t | 809 | 0.2% |
| d | 795 | 0.2% |
| o | 750 | 0.2% |
| Other values (9) | 2774 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 413186 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 201263 | |
| n | 199760 | |
| h | 1641 | 0.4% |
| z | 1599 | 0.4% |
| r | 1589 | 0.4% |
| f | 1160 | 0.3% |
| s | 1046 | 0.3% |
| t | 809 | 0.2% |
| d | 795 | 0.2% |
| o | 750 | 0.2% |
| Other values (9) | 2774 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 413186 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 201263 | |
| n | 199760 | |
| h | 1641 | 0.4% |
| z | 1599 | 0.4% |
| r | 1589 | 0.4% |
| f | 1160 | 0.3% |
| s | 1046 | 0.3% |
| t | 809 | 0.2% |
| d | 795 | 0.2% |
| o | 750 | 0.2% |
| Other values (9) | 2774 | 0.7% |
affiliate_channel
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| direct | |
|---|---|
| sem-brand | |
| sem-non-brand | |
| seo | 8420 |
| other | 8296 |
| Other values (3) | 12569 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.7501077 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1394525 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | direct |
|---|---|
| 2nd row | seo |
| 3rd row | direct |
| 4th row | direct |
| 5th row | direct |
| Value | Count | Frequency (%) |
| direct | 133678 | |
| sem-brand | 25681 | 12.4% |
| sem-non-brand | 17949 | 8.7% |
| seo | 8420 | 4.1% |
| other | 8296 | 4.0% |
| api | 7736 | 3.7% |
| content | 3780 | 1.8% |
| remarketing | 1053 | 0.5% |
| Value | Count | Frequency (%) |
| direct | 133678 | |
| sem-brand | 25681 | 12.4% |
| sem-non-brand | 17949 | 8.7% |
| seo | 8420 | 4.1% |
| other | 8296 | 4.0% |
| api | 7736 | 3.7% |
| content | 3780 | 1.8% |
| remarketing | 1053 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 199910 | |
| r | 187710 | |
| d | 177308 | |
| t | 150587 | |
| i | 142467 | |
| c | 137458 | |
| n | 88141 | |
| - | 61579 | 4.4% |
| a | 52419 | 3.8% |
| s | 52050 | 3.7% |
| Other values (7) | 144896 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1332946 | |
| Dash Punctuation | 61579 | 4.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 199910 | |
| r | 187710 | |
| d | 177308 | |
| t | 150587 | |
| i | 142467 | |
| c | 137458 | |
| n | 88141 | |
| a | 52419 | 3.9% |
| s | 52050 | 3.9% |
| m | 44683 | 3.4% |
| Other values (6) | 100213 |
| Value | Count | Frequency (%) |
| - | 61579 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1332946 | |
| Common | 61579 | 4.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 199910 | |
| r | 187710 | |
| d | 177308 | |
| t | 150587 | |
| i | 142467 | |
| c | 137458 | |
| n | 88141 | |
| a | 52419 | 3.9% |
| s | 52050 | 3.9% |
| m | 44683 | 3.4% |
| Other values (6) | 100213 |
| Value | Count | Frequency (%) |
| - | 61579 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1394525 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 199910 | |
| r | 187710 | |
| d | 177308 | |
| t | 150587 | |
| i | 142467 | |
| c | 137458 | |
| n | 88141 | |
| - | 61579 | 4.4% |
| a | 52419 | 3.8% |
| s | 52050 | 3.7% |
| Other values (7) | 144896 |
affiliate_provider
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| direct | |
|---|---|
| other | 11867 |
| craigslist | 2964 |
| bing | 2253 |
| Other values (13) | 5819 |
Length
| Max length | 19 |
|---|---|
| Median length | 6 |
| Mean length | 6.035233527 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1246837 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | direct |
|---|---|
| 2nd row | |
| 3rd row | direct |
| 4th row | direct |
| 5th row | direct |
| Value | Count | Frequency (%) |
| direct | 133438 | |
| 50252 | 24.3% | |
| other | 11867 | 5.7% |
| craigslist | 2964 | 1.4% |
| bing | 2253 | 1.1% |
| 2196 | 1.1% | |
| padmapper | 766 | 0.4% |
| vast | 748 | 0.4% |
| facebook-open-graph | 545 | 0.3% |
| yahoo | 495 | 0.2% |
| Other values (8) | 1069 | 0.5% |
| Value | Count | Frequency (%) |
| direct | 133438 | |
| 50252 | 24.3% | |
| other | 11867 | 5.7% |
| craigslist | 2964 | 1.4% |
| bing | 2253 | 1.1% |
| 2196 | 1.1% | |
| padmapper | 766 | 0.4% |
| vast | 748 | 0.4% |
| facebook-open-graph | 545 | 0.3% |
| yahoo | 495 | 0.2% |
| Other values (8) | 1069 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 200698 | |
| r | 149795 | |
| t | 149527 | |
| i | 141974 | |
| c | 139143 | |
| d | 134251 | |
| o | 119388 | |
| g | 106881 | |
| l | 53379 | 4.3% |
| h | 12907 | 1.0% |
| Other values (14) | 38894 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1245584 | |
| Dash Punctuation | 1253 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 200698 | |
| r | 149795 | |
| t | 149527 | |
| i | 141974 | |
| c | 139143 | |
| d | 134251 | |
| o | 119388 | |
| g | 106881 | |
| l | 53379 | 4.3% |
| h | 12907 | 1.0% |
| Other values (13) | 37641 | 3.0% |
| Value | Count | Frequency (%) |
| - | 1253 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1245584 | |
| Common | 1253 | 0.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 200698 | |
| r | 149795 | |
| t | 149527 | |
| i | 141974 | |
| c | 139143 | |
| d | 134251 | |
| o | 119388 | |
| g | 106881 | |
| l | 53379 | 4.3% |
| h | 12907 | 1.0% |
| Other values (13) | 37641 | 3.0% |
| Value | Count | Frequency (%) |
| - | 1253 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1246837 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 200698 | |
| r | 149795 | |
| t | 149527 | |
| i | 141974 | |
| c | 139143 | |
| d | 134251 | |
| o | 119388 | |
| g | 106881 | |
| l | 53379 | 4.3% |
| h | 12907 | 1.0% |
| Other values (14) | 38894 | 3.1% |
first_affiliate_tracked
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| untracked | |
|---|---|
| linked | |
| omg | |
| tracked-other | 6123 |
| product | 1545 |
| Other values (2) | 173 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.161457552 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1479507 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | untracked |
|---|---|
| 2nd row | untracked |
| 3rd row | untracked |
| 4th row | untracked |
| 5th row | untracked |
| Value | Count | Frequency (%) |
| untracked | 108838 | |
| linked | 46084 | |
| omg | 43830 | |
| tracked-other | 6123 | 3.0% |
| product | 1545 | 0.7% |
| marketing | 139 | 0.1% |
| local ops | 34 | < 0.1% |
| Value | Count | Frequency (%) |
| untracked | 108838 | |
| linked | 46084 | |
| omg | 43830 | |
| tracked-other | 6123 | 3.0% |
| product | 1545 | 0.7% |
| marketing | 139 | 0.1% |
| ops | 34 | < 0.1% |
| local | 34 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 167307 | |
| d | 162590 | |
| k | 161184 | |
| n | 155061 | |
| t | 122768 | |
| r | 122768 | |
| c | 116540 | |
| a | 115134 | |
| u | 110383 | |
| o | 51566 | 3.5% |
| Other values (9) | 194206 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1473350 | |
| Dash Punctuation | 6123 | 0.4% |
| Space Separator | 34 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 167307 | |
| d | 162590 | |
| k | 161184 | |
| n | 155061 | |
| t | 122768 | |
| r | 122768 | |
| c | 116540 | |
| a | 115134 | |
| u | 110383 | |
| o | 51566 | 3.5% |
| Other values (7) | 188049 |
| Value | Count | Frequency (%) |
| - | 6123 |
| Value | Count | Frequency (%) |
| 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1473350 | |
| Common | 6157 | 0.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 167307 | |
| d | 162590 | |
| k | 161184 | |
| n | 155061 | |
| t | 122768 | |
| r | 122768 | |
| c | 116540 | |
| a | 115134 | |
| u | 110383 | |
| o | 51566 | 3.5% |
| Other values (7) | 188049 |
| Value | Count | Frequency (%) |
| - | 6123 | |
| 34 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1479507 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 167307 | |
| d | 162590 | |
| k | 161184 | |
| n | 155061 | |
| t | 122768 | |
| r | 122768 | |
| c | 116540 | |
| a | 115134 | |
| u | 110383 | |
| o | 51566 | 3.5% |
| Other values (9) | 194206 |
signup_app
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| Web | |
|---|---|
| iOS | |
| Moweb | 5771 |
| Android | 5379 |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.160015102 |
| Min length | 3 |
Characters and Unicode
| Total characters | 652837 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Web |
|---|---|
| 2nd row | Web |
| 3rd row | Web |
| 4th row | Web |
| 5th row | Web |
| Value | Count | Frequency (%) |
| Web | 177591 | |
| iOS | 17852 | 8.6% |
| Moweb | 5771 | 2.8% |
| Android | 5379 | 2.6% |
| Value | Count | Frequency (%) |
| web | 177591 | |
| ios | 17852 | 8.6% |
| moweb | 5771 | 2.8% |
| android | 5379 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 183362 | |
| b | 183362 | |
| W | 177591 | |
| i | 23231 | 3.6% |
| O | 17852 | 2.7% |
| S | 17852 | 2.7% |
| o | 11150 | 1.7% |
| d | 10758 | 1.6% |
| M | 5771 | 0.9% |
| w | 5771 | 0.9% |
| Other values (3) | 16137 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 428392 | |
| Uppercase Letter | 224445 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 183362 | |
| b | 183362 | |
| i | 23231 | 5.4% |
| o | 11150 | 2.6% |
| d | 10758 | 2.5% |
| w | 5771 | 1.3% |
| n | 5379 | 1.3% |
| r | 5379 | 1.3% |
| Value | Count | Frequency (%) |
| W | 177591 | |
| O | 17852 | 8.0% |
| S | 17852 | 8.0% |
| M | 5771 | 2.6% |
| A | 5379 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 652837 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 183362 | |
| b | 183362 | |
| W | 177591 | |
| i | 23231 | 3.6% |
| O | 17852 | 2.7% |
| S | 17852 | 2.7% |
| o | 11150 | 1.7% |
| d | 10758 | 1.6% |
| M | 5771 | 0.9% |
| w | 5771 | 0.9% |
| Other values (3) | 16137 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 652837 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 183362 | |
| b | 183362 | |
| W | 177591 | |
| i | 23231 | 3.6% |
| O | 17852 | 2.7% |
| S | 17852 | 2.7% |
| o | 11150 | 1.7% |
| d | 10758 | 1.6% |
| M | 5771 | 0.9% |
| w | 5771 | 0.9% |
| Other values (3) | 16137 | 2.5% |
first_device_type
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| Mac Desktop | |
|---|---|
| Windows Desktop | |
| iPhone | |
| iPad | |
| Other/Unknown | 4591 |
| Other values (4) | 5344 |
Length
| Max length | 18 |
|---|---|
| Median length | 11 |
| Mean length | 11.53261727 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2382558 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mac Desktop |
|---|---|
| 2nd row | Mac Desktop |
| 3rd row | Windows Desktop |
| 4th row | Mac Desktop |
| 5th row | Mac Desktop |
| Value | Count | Frequency (%) |
| Mac Desktop | 89255 | |
| Windows Desktop | 72410 | |
| iPhone | 20712 | 10.0% |
| iPad | 14281 | 6.9% |
| Other/Unknown | 4591 | 2.2% |
| Android Phone | 2788 | 1.3% |
| Android Tablet | 1285 | 0.6% |
| Desktop (Other) | 1196 | 0.6% |
| SmartPhone (Other) | 75 | < 0.1% |
| Value | Count | Frequency (%) |
| desktop | 162861 | |
| mac | 89255 | |
| windows | 72410 | |
| iphone | 20712 | 5.5% |
| ipad | 14281 | 3.8% |
| other/unknown | 4591 | 1.2% |
| android | 4073 | 1.1% |
| phone | 2788 | 0.7% |
| tablet | 1285 | 0.3% |
| other | 1271 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 267510 | |
| s | 235271 | 9.9% |
| e | 193583 | 8.1% |
| t | 170083 | 7.1% |
| k | 167452 | 7.0% |
| 167009 | 7.0% | |
| D | 162861 | 6.8% |
| p | 162861 | 6.8% |
| n | 113831 | 4.8% |
| i | 111476 | 4.7% |
| Other values (20) | 630621 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1830148 | |
| Uppercase Letter | 378268 | 15.9% |
| Space Separator | 167009 | 7.0% |
| Other Punctuation | 4591 | 0.2% |
| Open Punctuation | 1271 | 0.1% |
| Close Punctuation | 1271 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 267510 | |
| s | 235271 | |
| e | 193583 | |
| t | 170083 | |
| k | 167452 | |
| p | 162861 | |
| n | 113831 | |
| i | 111476 | |
| a | 104896 | 5.7% |
| d | 94837 | 5.2% |
| Other values (7) | 208348 |
| Value | Count | Frequency (%) |
| D | 162861 | |
| M | 89255 | |
| W | 72410 | |
| P | 37856 | 10.0% |
| O | 5862 | 1.5% |
| U | 4591 | 1.2% |
| A | 4073 | 1.1% |
| T | 1285 | 0.3% |
| S | 75 | < 0.1% |
| Value | Count | Frequency (%) |
| 167009 |
| Value | Count | Frequency (%) |
| / | 4591 |
| Value | Count | Frequency (%) |
| ( | 1271 |
| Value | Count | Frequency (%) |
| ) | 1271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2208416 | |
| Common | 174142 | 7.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 267510 | |
| s | 235271 | |
| e | 193583 | 8.8% |
| t | 170083 | 7.7% |
| k | 167452 | 7.6% |
| D | 162861 | 7.4% |
| p | 162861 | 7.4% |
| n | 113831 | 5.2% |
| i | 111476 | 5.0% |
| a | 104896 | 4.7% |
| Other values (16) | 518592 |
| Value | Count | Frequency (%) |
| 167009 | ||
| / | 4591 | 2.6% |
| ( | 1271 | 0.7% |
| ) | 1271 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2382558 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 267510 | |
| s | 235271 | 9.9% |
| e | 193583 | 8.1% |
| t | 170083 | 7.1% |
| k | 167452 | 7.0% |
| 167009 | 7.0% | |
| D | 162861 | 6.8% |
| p | 162861 | 6.8% |
| n | 113831 | 4.8% |
| i | 111476 | 4.7% |
| Other values (20) | 630621 |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| Chrome | |
|---|---|
| Safari | |
| Firefox | |
| -unknown- | |
| IE | |
| Other values (47) |
Length
| Max length | 20 |
|---|---|
| Median length | 6 |
| Mean length | 6.808502708 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1406589 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Chrome |
|---|---|
| 2nd row | Chrome |
| 3rd row | IE |
| 4th row | Firefox |
| 5th row | Chrome |
| Value | Count | Frequency (%) |
| Chrome | 63620 | |
| Safari | 44981 | |
| Firefox | 33513 | |
| -unknown- | 21166 | 10.2% |
| IE | 20970 | 10.2% |
| Mobile Safari | 19195 | 9.3% |
| Chrome Mobile | 1258 | 0.6% |
| Android Browser | 844 | 0.4% |
| AOL Explorer | 240 | 0.1% |
| Opera | 187 | 0.1% |
| Other values (42) | 619 | 0.3% |
| Value | Count | Frequency (%) |
| chrome | 64878 | |
| safari | 64176 | |
| firefox | 33543 | |
| unknown | 21166 | 9.3% |
| ie | 21006 | 9.2% |
| mobile | 20521 | 9.0% |
| browser | 907 | 0.4% |
| android | 844 | 0.4% |
| explorer | 273 | 0.1% |
| aol | 240 | 0.1% |
| Other values (48) | 799 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 166259 | |
| o | 142522 | 10.1% |
| a | 128752 | 9.2% |
| e | 120588 | 8.6% |
| i | 119405 | 8.5% |
| f | 97719 | 6.9% |
| m | 65051 | 4.6% |
| h | 65000 | 4.6% |
| C | 64982 | 4.6% |
| n | 64472 | 4.6% |
| Other values (40) | 371839 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1113623 | |
| Uppercase Letter | 228859 | 16.3% |
| Dash Punctuation | 42332 | 3.0% |
| Space Separator | 21760 | 1.5% |
| Other Punctuation | 11 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| r | 166259 | |
| o | 142522 | |
| a | 128752 | |
| e | 120588 | |
| i | 119405 | |
| f | 97719 | |
| m | 65051 | 5.8% |
| h | 65000 | 5.8% |
| n | 64472 | 5.8% |
| x | 33881 | 3.0% |
| Other values (14) | 109974 |
| Value | Count | Frequency (%) |
| C | 64982 | |
| S | 64378 | |
| F | 33561 | |
| E | 21281 | 9.3% |
| I | 21037 | 9.2% |
| M | 20657 | 9.0% |
| A | 1125 | 0.5% |
| B | 1039 | 0.5% |
| O | 442 | 0.2% |
| L | 240 | 0.1% |
| Other values (10) | 117 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 2 | 1 | |
| 7 | 1 |
| Value | Count | Frequency (%) |
| - | 42332 |
| Value | Count | Frequency (%) |
| 21760 |
| Value | Count | Frequency (%) |
| . | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1342482 | |
| Common | 64107 | 4.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| r | 166259 | |
| o | 142522 | |
| a | 128752 | |
| e | 120588 | 9.0% |
| i | 119405 | 8.9% |
| f | 97719 | 7.3% |
| m | 65051 | 4.8% |
| h | 65000 | 4.8% |
| C | 64982 | 4.8% |
| n | 64472 | 4.8% |
| Other values (34) | 307732 |
| Value | Count | Frequency (%) |
| - | 42332 | |
| 21760 | ||
| . | 11 | < 0.1% |
| 0 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1406589 |
Most frequent character per block
| Value | Count | Frequency (%) |
| r | 166259 | |
| o | 142522 | 10.1% |
| a | 128752 | 9.2% |
| e | 120588 | 8.6% |
| i | 119405 | 8.5% |
| f | 97719 | 6.9% |
| m | 65051 | 4.6% |
| h | 65000 | 4.6% |
| C | 64982 | 4.6% |
| n | 64472 | 4.6% |
| Other values (40) | 371839 |
country_destination
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| NDF | |
|---|---|
| US | |
| other | 9935 |
| FR | 4881 |
| IT | 2776 |
| Other values (7) | 8391 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 2.724201691 |
| Min length | 2 |
Characters and Unicode
| Total characters | 562801 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NDF |
|---|---|
| 2nd row | NDF |
| 3rd row | US |
| 4th row | other |
| 5th row | US |
| Value | Count | Frequency (%) |
| NDF | 119810 | |
| US | 60800 | |
| other | 9935 | 4.8% |
| FR | 4881 | 2.4% |
| IT | 2776 | 1.3% |
| GB | 2285 | 1.1% |
| ES | 2203 | 1.1% |
| CA | 1385 | 0.7% |
| DE | 1033 | 0.5% |
| NL | 746 | 0.4% |
| Other values (2) | 739 | 0.4% |
| Value | Count | Frequency (%) |
| ndf | 119810 | |
| us | 60800 | |
| other | 9935 | 4.8% |
| fr | 4881 | 2.4% |
| it | 2776 | 1.3% |
| gb | 2285 | 1.1% |
| es | 2203 | 1.1% |
| ca | 1385 | 0.7% |
| de | 1033 | 0.5% |
| nl | 746 | 0.4% |
| Other values (2) | 739 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 124691 | |
| D | 120843 | |
| N | 120556 | |
| S | 63003 | |
| U | 61326 | |
| o | 9935 | 1.8% |
| t | 9935 | 1.8% |
| h | 9935 | 1.8% |
| e | 9935 | 1.8% |
| r | 9935 | 1.8% |
| Other values (10) | 22707 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 513126 | |
| Lowercase Letter | 49675 | 8.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 124691 | |
| D | 120843 | |
| N | 120556 | |
| S | 63003 | |
| U | 61326 | |
| R | 4881 | 1.0% |
| E | 3236 | 0.6% |
| T | 2989 | 0.6% |
| I | 2776 | 0.5% |
| G | 2285 | 0.4% |
| Other values (5) | 6540 | 1.3% |
| Value | Count | Frequency (%) |
| o | 9935 | |
| t | 9935 | |
| h | 9935 | |
| e | 9935 | |
| r | 9935 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 562801 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 124691 | |
| D | 120843 | |
| N | 120556 | |
| S | 63003 | |
| U | 61326 | |
| o | 9935 | 1.8% |
| t | 9935 | 1.8% |
| h | 9935 | 1.8% |
| e | 9935 | 1.8% |
| r | 9935 | 1.8% |
| Other values (10) | 22707 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 562801 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 124691 | |
| D | 120843 | |
| N | 120556 | |
| S | 63003 | |
| U | 61326 | |
| o | 9935 | 1.8% |
| t | 9935 | 1.8% |
| h | 9935 | 1.8% |
| e | 9935 | 1.8% |
| r | 9935 | 1.8% |
| Other values (10) | 22707 | 4.0% |
| Distinct | 1976 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3379.391916 |
|---|---|
| Minimum | 0 |
| Maximum | 7393 |
| Zeros | 20738 |
| Zeros (%) | 10.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 5516 |
| Q3 | 5778 |
| 95-th percentile | 6208 |
| Maximum | 7393 |
| Range | 7393 |
| Interquartile range (IQR) | 5772 |
Descriptive statistics
| Standard deviation | 2845.881127 |
|---|---|
| Coefficient of variation (CV) | 0.8421281692 |
| Kurtosis | -1.878777468 |
| Mean | 3379.391916 |
| Median Absolute Deviation (MAD) | 618 |
| Skewness | -0.307731876 |
| Sum | 698158714 |
| Variance | 8099039.388 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20738 | 10.0% |
| 1 | 14288 | 6.9% |
| 2 | 6307 | 3.1% |
| 3 | 3894 | 1.9% |
| 4 | 2845 | 1.4% |
| 5 | 2193 | 1.1% |
| 6 | 1735 | 0.8% |
| 7 | 1611 | 0.8% |
| 8 | 1275 | 0.6% |
| 9 | 1024 | 0.5% |
| Other values (1966) | 150683 |
| Value | Count | Frequency (%) |
| 0 | 20738 | |
| 1 | 14288 | |
| 2 | 6307 | 3.1% |
| 3 | 3894 | 1.9% |
| 4 | 2845 | 1.4% |
| Value | Count | Frequency (%) |
| 7393 | 1 | |
| 7328 | 1 | |
| 7101 | 2 | |
| 7099 | 1 | |
| 7095 | 2 |
| Distinct | 142 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.236682753 |
|---|---|
| Minimum | 0 |
| Maximum | 1456 |
| Zeros | 206421 |
| Zeros (%) | 99.9% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1456 |
| Range | 1456 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 12.1122496 |
|---|---|
| Coefficient of variation (CV) | 51.17504104 |
| Kurtosis | 5699.565643 |
| Mean | 0.236682753 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 69.29642597 |
| Sum | 48897 |
| Variance | 146.7065904 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 206421 | |
| 1 | 6 | < 0.1% |
| 6 | 4 | < 0.1% |
| 3 | 3 | < 0.1% |
| 29 | 3 | < 0.1% |
| 5 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 2 | 3 | < 0.1% |
| 176 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| Other values (132) | 143 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 206421 | |
| 1 | 6 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1456 | 1 | |
| 1369 | 1 | |
| 1361 | 1 | |
| 1148 | 1 | |
| 1036 | 1 |
| Distinct | 1965 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3379.155233 |
|---|---|
| Minimum | -349 |
| Maximum | 7101 |
| Zeros | 20741 |
| Zeros (%) | 10.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -349 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 5516 |
| Q3 | 5778 |
| 95-th percentile | 6208 |
| Maximum | 7101 |
| Range | 7450 |
| Interquartile range (IQR) | 5772 |
Descriptive statistics
| Standard deviation | 2845.939191 |
|---|---|
| Coefficient of variation (CV) | 0.8422043366 |
| Kurtosis | -1.8788465 |
| Mean | 3379.155233 |
| Median Absolute Deviation (MAD) | 618 |
| Skewness | -0.3077348916 |
| Sum | 698109817 |
| Variance | 8099369.879 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20741 | 10.0% |
| 1 | 14289 | 6.9% |
| 2 | 6309 | 3.1% |
| 3 | 3897 | 1.9% |
| 4 | 2845 | 1.4% |
| 5 | 2197 | 1.1% |
| 6 | 1738 | 0.8% |
| 7 | 1611 | 0.8% |
| 8 | 1276 | 0.6% |
| 9 | 1022 | 0.5% |
| Other values (1955) | 150668 |
| Value | Count | Frequency (%) |
| -349 | 1 | |
| -347 | 1 | |
| -338 | 1 | |
| -308 | 1 | |
| -298 | 1 |
| Value | Count | Frequency (%) |
| 7101 | 2 | |
| 7099 | 1 | |
| 7095 | 2 | |
| 7094 | 1 | |
| 7092 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.062703 |
|---|---|
| Minimum | 2009 |
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 2009 |
|---|---|
| 5-th percentile | 2011 |
| Q1 | 2013 |
| median | 2013 |
| Q3 | 2014 |
| 95-th percentile | 2014 |
| Maximum | 2014 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9011508238 |
|---|---|
| Coefficient of variation (CV) | 0.0004476516417 |
| Kurtosis | 0.3051540033 |
| Mean | 2013.062703 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.8084367444 |
| Sum | 415884663 |
| Variance | 0.8120728072 |
| Monotocity | Increasing |
| Value | Count | Frequency (%) |
| 2013 | 81841 | |
| 2014 | 75496 | |
| 2012 | 37950 | |
| 2011 | 9331 | 4.5% |
| 2010 | 1970 | 1.0% |
| 2009 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 2009 | 5 | < 0.1% |
| 2010 | 1970 | 1.0% |
| 2011 | 9331 | 4.5% |
| 2012 | 37950 | |
| 2013 | 81841 |
| Value | Count | Frequency (%) |
| 2014 | 75496 | |
| 2013 | 81841 | |
| 2012 | 37950 | |
| 2011 | 9331 | 4.5% |
| 2010 | 1970 | 1.0% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.016956044 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.221454147 |
|---|---|
| Coefficient of variation (CV) | 0.5353959915 |
| Kurtosis | -0.9528826524 |
| Mean | 6.016956044 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.2529472005 |
| Sum | 1243061 |
| Variance | 10.37776682 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 27033 | |
| 5 | 25525 | |
| 4 | 21333 | |
| 3 | 19482 | |
| 1 | 16768 | |
| 2 | 15853 | |
| 9 | 14774 | |
| 8 | 14061 | |
| 7 | 13410 | |
| 10 | 13031 | |
| Other values (2) | 25323 |
| Value | Count | Frequency (%) |
| 1 | 16768 | |
| 2 | 15853 | |
| 3 | 19482 | |
| 4 | 21333 | |
| 5 | 25525 |
| Value | Count | Frequency (%) |
| 12 | 12799 | |
| 11 | 12524 | |
| 10 | 13031 | |
| 9 | 14774 | |
| 8 | 14061 |
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.8726772 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.74204279 |
|---|---|
| Coefficient of variation (CV) | 0.5507604471 |
| Kurtosis | -1.187475222 |
| Mean | 15.8726772 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.01122201687 |
| Sum | 3279184 |
| Variance | 76.42331214 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 7182 | 3.5% |
| 20 | 7014 | 3.4% |
| 18 | 7010 | 3.4% |
| 16 | 6988 | 3.4% |
| 23 | 6988 | 3.4% |
| 19 | 6956 | 3.4% |
| 28 | 6915 | 3.3% |
| 26 | 6908 | 3.3% |
| 17 | 6908 | 3.3% |
| 13 | 6889 | 3.3% |
| Other values (21) | 136835 |
| Value | Count | Frequency (%) |
| 1 | 5967 | |
| 2 | 6561 | |
| 3 | 6750 | |
| 4 | 6620 | |
| 5 | 6817 |
| Value | Count | Frequency (%) |
| 31 | 3607 | |
| 30 | 6587 | |
| 29 | 6363 | |
| 28 | 6915 | |
| 27 | 6842 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.762053893 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 31837 |
| Zeros (%) | 15.4% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.944268623 |
|---|---|
| Coefficient of variation (CV) | 0.7039213201 |
| Kurtosis | -1.149772853 |
| Mean | 2.762053893 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1677710956 |
| Sum | 570621 |
| Variance | 3.780180478 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 33988 | |
| 2 | 33041 | |
| 0 | 31837 | |
| 3 | 31504 | |
| 4 | 28807 | |
| 6 | 23731 | |
| 5 | 23685 |
| Value | Count | Frequency (%) |
| 0 | 31837 | |
| 1 | 33988 | |
| 2 | 33041 | |
| 3 | 31504 | |
| 4 | 28807 |
| Value | Count | Frequency (%) |
| 6 | 23731 | |
| 5 | 23685 | |
| 4 | 28807 | |
| 3 | 31504 | |
| 2 | 33041 |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.37388973 |
|---|---|
| Minimum | 1 |
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 13 |
| median | 23 |
| Q3 | 36 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.95400971 |
|---|---|
| Coefficient of variation (CV) | 0.5724982704 |
| Kurtosis | -0.9411483504 |
| Mean | 24.37388973 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.2534181308 |
| Sum | 5035475 |
| Variance | 194.714387 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 6779 | 3.3% |
| 25 | 6426 | 3.1% |
| 24 | 6164 | 3.0% |
| 21 | 6116 | 3.0% |
| 23 | 6062 | 2.9% |
| 20 | 6057 | 2.9% |
| 22 | 5602 | 2.7% |
| 19 | 5490 | 2.7% |
| 18 | 5433 | 2.6% |
| 17 | 5309 | 2.6% |
| Other values (43) | 147155 |
| Value | Count | Frequency (%) |
| 1 | 3196 | |
| 2 | 3824 | |
| 3 | 4026 | |
| 4 | 3785 | |
| 5 | 3794 |
| Value | Count | Frequency (%) |
| 53 | 3 | < 0.1% |
| 52 | 2671 | |
| 51 | 2784 | |
| 50 | 2890 | |
| 49 | 3170 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2022.314619 |
|---|---|
| Minimum | 2010 |
| Maximum | 2029 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 2010 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2013 |
| median | 2029 |
| Q3 | 2029 |
| 95-th percentile | 2029 |
| Maximum | 2029 |
| Range | 19 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 7.880815138 |
|---|---|
| Coefficient of variation (CV) | 0.003896928334 |
| Kurtosis | -1.852958414 |
| Mean | 2022.314619 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.3441378278 |
| Sum | 417796044 |
| Variance | 62.10724723 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2029 | 119810 | |
| 2014 | 32334 | 15.7% |
| 2013 | 31083 | 15.0% |
| 2012 | 15797 | 7.6% |
| 2011 | 4690 | 2.3% |
| 2015 | 1771 | 0.9% |
| 2010 | 1108 | 0.5% |
| Value | Count | Frequency (%) |
| 2010 | 1108 | 0.5% |
| 2011 | 4690 | 2.3% |
| 2012 | 15797 | |
| 2013 | 31083 | |
| 2014 | 32334 |
| Value | Count | Frequency (%) |
| 2029 | 119810 | |
| 2015 | 1771 | 0.9% |
| 2014 | 32334 | 15.7% |
| 2013 | 31083 | 15.0% |
| 2012 | 15797 | 7.6% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.043592958 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.059181648 |
|---|---|
| Coefficient of variation (CV) | 0.3407214321 |
| Kurtosis | 1.874039975 |
| Mean | 6.043592958 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.385503438 |
| Sum | 1248564 |
| Variance | 4.240229059 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 130148 | |
| 5 | 10322 | 5.0% |
| 4 | 8624 | 4.2% |
| 3 | 8159 | 3.9% |
| 7 | 7073 | 3.4% |
| 8 | 6854 | 3.3% |
| 2 | 6616 | 3.2% |
| 9 | 6402 | 3.1% |
| 1 | 6338 | 3.1% |
| 10 | 6009 | 2.9% |
| Other values (2) | 10048 | 4.9% |
| Value | Count | Frequency (%) |
| 1 | 6338 | |
| 2 | 6616 | |
| 3 | 8159 | |
| 4 | 8624 | |
| 5 | 10322 |
| Value | Count | Frequency (%) |
| 12 | 4944 | |
| 11 | 5104 | |
| 10 | 6009 | |
| 9 | 6402 | |
| 8 | 6854 |
day_first_booking
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.27325708 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 15 |
| Q3 | 15 |
| 95-th percentile | 27 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.663609584 |
|---|---|
| Coefficient of variation (CV) | 0.3708187163 |
| Kurtosis | 1.351926905 |
| Mean | 15.27325708 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.2350356536 |
| Sum | 3155348 |
| Variance | 32.07647352 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 122760 | |
| 10 | 3009 | 1.5% |
| 17 | 2996 | 1.5% |
| 11 | 2982 | 1.4% |
| 16 | 2964 | 1.4% |
| 13 | 2950 | 1.4% |
| 5 | 2918 | 1.4% |
| 12 | 2892 | 1.4% |
| 8 | 2882 | 1.4% |
| 3 | 2882 | 1.4% |
| Other values (21) | 57358 |
| Value | Count | Frequency (%) |
| 1 | 2690 | |
| 2 | 2807 | |
| 3 | 2882 | |
| 4 | 2784 | |
| 5 | 2918 |
| Value | Count | Frequency (%) |
| 31 | 1526 | |
| 30 | 2650 | |
| 29 | 2557 | |
| 28 | 2809 | |
| 27 | 2698 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.497320819 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 12407 |
| Zeros (%) | 6.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.374571803 |
|---|---|
| Coefficient of variation (CV) | 0.3930356618 |
| Kurtosis | 0.9424697317 |
| Mean | 3.497320819 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.10941395 |
| Sum | 722522 |
| Variance | 1.889447641 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 132782 | |
| 2 | 14029 | 6.8% |
| 1 | 13970 | 6.8% |
| 3 | 13627 | 6.6% |
| 0 | 12407 | 6.0% |
| 5 | 10183 | 4.9% |
| 6 | 9595 | 4.6% |
| Value | Count | Frequency (%) |
| 0 | 12407 | 6.0% |
| 1 | 13970 | 6.8% |
| 2 | 14029 | 6.8% |
| 3 | 13627 | 6.6% |
| 4 | 132782 |
| Value | Count | Frequency (%) |
| 6 | 9595 | 4.6% |
| 5 | 10183 | 4.9% |
| 4 | 132782 | |
| 3 | 13627 | 6.6% |
| 2 | 14029 | 6.8% |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.30690778 |
|---|---|
| Minimum | 1 |
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 24 |
| median | 24 |
| Q3 | 24 |
| 95-th percentile | 44 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 8.923814519 |
|---|---|
| Coefficient of variation (CV) | 0.3671308008 |
| Kurtosis | 1.914570671 |
| Mean | 24.30690778 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.4439373104 |
| Sum | 5021637 |
| Variance | 79.63446557 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 122272 | |
| 26 | 2488 | 1.2% |
| 21 | 2451 | 1.2% |
| 20 | 2434 | 1.2% |
| 25 | 2403 | 1.2% |
| 23 | 2366 | 1.1% |
| 18 | 2277 | 1.1% |
| 19 | 2263 | 1.1% |
| 22 | 2218 | 1.1% |
| 15 | 2035 | 1.0% |
| Other values (43) | 63386 |
| Value | Count | Frequency (%) |
| 1 | 1092 | |
| 2 | 1438 | |
| 3 | 1717 | |
| 4 | 1401 | |
| 5 | 1444 |
| Value | Count | Frequency (%) |
| 53 | 1 | < 0.1% |
| 52 | 915 | |
| 51 | 1130 | |
| 50 | 1172 | |
| 49 | 1249 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 2013 | |
|---|---|
| 2014 | |
| 2012 | |
| 2011 | |
| 2010 | 1961 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 826372 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2010 |
|---|---|
| 2nd row | 2011 |
| 3rd row | 2010 |
| 4th row | 2011 |
| 5th row | 2010 |
| Value | Count | Frequency (%) |
| 2013 | 81851 | |
| 2014 | 75532 | |
| 2012 | 37936 | |
| 2011 | 9313 | 4.5% |
| 2010 | 1961 | 0.9% |
| Value | Count | Frequency (%) |
| 2013 | 81851 | |
| 2014 | 75532 | |
| 2012 | 37936 | |
| 2011 | 9313 | 4.5% |
| 2010 | 1961 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 244529 | |
| 1 | 215906 | |
| 0 | 208554 | |
| 3 | 81851 | 9.9% |
| 4 | 75532 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 826372 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 244529 | |
| 1 | 215906 | |
| 0 | 208554 | |
| 3 | 81851 | 9.9% |
| 4 | 75532 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 826372 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 244529 | |
| 1 | 215906 | |
| 0 | 208554 | |
| 3 | 81851 | 9.9% |
| 4 | 75532 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 826372 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 244529 | |
| 1 | 215906 | |
| 0 | 208554 | |
| 3 | 81851 | 9.9% |
| 4 | 75532 | 9.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.016994767 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.221653778 |
|---|---|
| Coefficient of variation (CV) | 0.5354257238 |
| Kurtosis | -0.9530348208 |
| Mean | 6.016994767 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.252885905 |
| Sum | 1243069 |
| Variance | 10.37905307 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 27028 | |
| 5 | 25531 | |
| 4 | 21324 | |
| 3 | 19481 | |
| 1 | 16773 | |
| 2 | 15855 | |
| 9 | 14780 | |
| 8 | 14060 | |
| 7 | 13405 | |
| 10 | 13030 | |
| Other values (2) | 25326 |
| Value | Count | Frequency (%) |
| 1 | 16773 | |
| 2 | 15855 | |
| 3 | 19481 | |
| 4 | 21324 | |
| 5 | 25531 |
| Value | Count | Frequency (%) |
| 12 | 12803 | |
| 11 | 12523 | |
| 10 | 13030 | |
| 9 | 14780 | |
| 8 | 14060 |
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.8729531 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.742507746 |
|---|---|
| Coefficient of variation (CV) | 0.5507801661 |
| Kurtosis | -1.187560979 |
| Mean | 15.8729531 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.01138779605 |
| Sum | 3279241 |
| Variance | 76.43144168 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 7183 | 3.5% |
| 20 | 7018 | 3.4% |
| 18 | 7009 | 3.4% |
| 16 | 6994 | 3.4% |
| 23 | 6990 | 3.4% |
| 19 | 6955 | 3.4% |
| 28 | 6921 | 3.4% |
| 17 | 6904 | 3.3% |
| 26 | 6904 | 3.3% |
| 13 | 6883 | 3.3% |
| Other values (21) | 136832 |
| Value | Count | Frequency (%) |
| 1 | 5969 | |
| 2 | 6564 | |
| 3 | 6754 | |
| 4 | 6620 | |
| 5 | 6817 |
| Value | Count | Frequency (%) |
| 31 | 3605 | |
| 30 | 6588 | |
| 29 | 6366 | |
| 28 | 6921 | |
| 27 | 6841 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.762199106 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 31830 |
| Zeros (%) | 15.4% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.944268962 |
|---|---|
| Coefficient of variation (CV) | 0.7038844367 |
| Kurtosis | -1.1498462 |
| Mean | 2.762199106 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1677017806 |
| Sum | 570651 |
| Variance | 3.780181796 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 33992 | |
| 2 | 33040 | |
| 0 | 31830 | |
| 3 | 31499 | |
| 4 | 28811 | |
| 6 | 23733 | |
| 5 | 23688 |
| Value | Count | Frequency (%) |
| 0 | 31830 | |
| 1 | 33992 | |
| 2 | 33040 | |
| 3 | 31499 | |
| 4 | 28811 |
| Value | Count | Frequency (%) |
| 6 | 23733 | |
| 5 | 23688 | |
| 4 | 28811 | |
| 3 | 31499 | |
| 2 | 33040 |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.37418015 |
|---|---|
| Minimum | 1 |
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 13 |
| median | 23 |
| Q3 | 36 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.95489097 |
|---|---|
| Coefficient of variation (CV) | 0.5725276042 |
| Kurtosis | -0.9413055266 |
| Mean | 24.37418015 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.2533569758 |
| Sum | 5035535 |
| Variance | 194.7389819 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 6777 | 3.3% |
| 25 | 6426 | 3.1% |
| 24 | 6164 | 3.0% |
| 21 | 6116 | 3.0% |
| 23 | 6061 | 2.9% |
| 20 | 6057 | 2.9% |
| 22 | 5606 | 2.7% |
| 19 | 5486 | 2.7% |
| 18 | 5441 | 2.6% |
| 17 | 5308 | 2.6% |
| Other values (43) | 147151 |
| Value | Count | Frequency (%) |
| 1 | 3198 | |
| 2 | 3826 | |
| 3 | 4025 | |
| 4 | 3785 | |
| 5 | 3795 |
| Value | Count | Frequency (%) |
| 53 | 3 | < 0.1% |
| 52 | 2671 | |
| 51 | 2785 | |
| 50 | 2891 | |
| 49 | 3173 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | id | gender | age | signup_method | signup_flow | language | affiliate_channel | affiliate_provider | first_affiliate_tracked | signup_app | first_device_type | first_browser | country_destination | days_from_first_active_until_booking | days_from_first_active_until_account_created | days_from_account_created_until_first_booking | year_first_active | month_first_active | day_first_active | dayofweek_first_active | weekodyear_first_active | year_first_booking | month_first_booking | day_first_booking | dayofweek_first_booking | weekofyear_first_booking | year_first_created_account | month_first_created_account | day_first_created_account | dayofweek_first_created_account | weekofyear_first_created_account | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | gxn3p5htnn | -unknown- | 49 | 0 | en | direct | direct | untracked | Web | Mac Desktop | Chrome | NDF | 7393 | 466 | 6927 | 2009 | 3 | 19 | 3 | 12 | 2029 | 6 | 15 | 4 | 24 | 2010 | 6 | 28 | 0 | 26 | |
| 1 | 1 | 820tgsjxq7 | MALE | 38 | 0 | en | seo | untracked | Web | Mac Desktop | Chrome | NDF | 7328 | 732 | 6596 | 2009 | 5 | 23 | 5 | 21 | 2029 | 6 | 15 | 4 | 24 | 2011 | 5 | 25 | 2 | 21 | ||
| 2 | 2 | 4ft3gnwmtx | FEMALE | 56 | basic | 3 | en | direct | direct | untracked | Web | Windows Desktop | IE | US | 419 | 476 | -57 | 2009 | 6 | 9 | 1 | 24 | 2010 | 8 | 2 | 0 | 31 | 2010 | 9 | 28 | 1 | 39 |
| 3 | 3 | bjjt8pjhuk | FEMALE | 42 | 0 | en | direct | direct | untracked | Web | Mac Desktop | Firefox | other | 1043 | 765 | 278 | 2009 | 10 | 31 | 5 | 44 | 2012 | 9 | 8 | 5 | 36 | 2011 | 12 | 5 | 0 | 49 | |
| 4 | 4 | 87mebub9p4 | -unknown- | 41 | basic | 0 | en | direct | direct | untracked | Web | Mac Desktop | Chrome | US | 72 | 280 | -208 | 2009 | 12 | 8 | 1 | 50 | 2010 | 2 | 18 | 3 | 7 | 2010 | 9 | 14 | 1 | 37 |
| 5 | 5 | osr2jwljor | -unknown- | 49 | basic | 0 | en | other | other | omg | Web | Mac Desktop | Chrome | US | 1 | 0 | 1 | 2010 | 1 | 1 | 4 | 53 | 2010 | 1 | 2 | 5 | 53 | 2010 | 1 | 1 | 4 | 53 |
| 6 | 6 | lsw9q7uk0j | FEMALE | 46 | basic | 0 | en | other | craigslist | untracked | Web | Mac Desktop | Safari | US | 3 | 0 | 3 | 2010 | 1 | 2 | 5 | 53 | 2010 | 1 | 5 | 1 | 1 | 2010 | 1 | 2 | 5 | 53 |
| 7 | 7 | 0d01nltbrs | FEMALE | 47 | basic | 0 | en | direct | direct | omg | Web | Mac Desktop | Safari | US | 10 | 0 | 10 | 2010 | 1 | 3 | 6 | 53 | 2010 | 1 | 13 | 2 | 2 | 2010 | 1 | 3 | 6 | 53 |
| 8 | 8 | a1vcnhxeij | FEMALE | 50 | basic | 0 | en | other | craigslist | untracked | Web | Mac Desktop | Safari | US | 206 | 0 | 206 | 2010 | 1 | 4 | 0 | 1 | 2010 | 7 | 29 | 3 | 30 | 2010 | 1 | 4 | 0 | 1 |
| 9 | 9 | 6uh8zyj2gn | -unknown- | 46 | basic | 0 | en | other | craigslist | omg | Web | Mac Desktop | Firefox | US | 0 | 0 | 0 | 2010 | 1 | 4 | 0 | 1 | 2010 | 1 | 4 | 0 | 1 | 2010 | 1 | 4 | 0 | 1 |
Last rows
| df_index | id | gender | age | signup_method | signup_flow | language | affiliate_channel | affiliate_provider | first_affiliate_tracked | signup_app | first_device_type | first_browser | country_destination | days_from_first_active_until_booking | days_from_first_active_until_account_created | days_from_account_created_until_first_booking | year_first_active | month_first_active | day_first_active | dayofweek_first_active | weekodyear_first_active | year_first_booking | month_first_booking | day_first_booking | dayofweek_first_booking | weekofyear_first_booking | year_first_created_account | month_first_created_account | day_first_created_account | dayofweek_first_created_account | weekofyear_first_created_account | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 206583 | 213441 | omlc9iku7t | FEMALE | 34 | basic | 0 | en | direct | direct | linked | Web | Mac Desktop | Chrome | ES | 44 | 0 | 44 | 2014 | 6 | 30 | 0 | 27 | 2014 | 8 | 13 | 2 | 33 | 2014 | 6 | 30 | 0 | 27 |
| 206584 | 213442 | rf0ay567js | -unknown- | 49 | basic | 0 | en | sem-brand | omg | Web | Mac Desktop | Chrome | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 | |
| 206585 | 213443 | 0k26r3mir0 | FEMALE | 36 | basic | 0 | en | sem-brand | linked | Web | Mac Desktop | Safari | US | 13 | 0 | 13 | 2014 | 6 | 30 | 0 | 27 | 2014 | 7 | 13 | 6 | 28 | 2014 | 6 | 30 | 0 | 27 | |
| 206586 | 213444 | 40o1ivh6cb | -unknown- | 49 | basic | 0 | en | direct | direct | linked | Web | Windows Desktop | Chrome | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 |
| 206587 | 213445 | qbxza0xojf | FEMALE | 23 | basic | 0 | en | sem-brand | omg | Web | Windows Desktop | IE | US | 2 | 0 | 2 | 2014 | 6 | 30 | 0 | 27 | 2014 | 7 | 2 | 2 | 27 | 2014 | 6 | 30 | 0 | 27 | |
| 206588 | 213446 | zxodksqpep | MALE | 32 | basic | 0 | en | sem-brand | omg | Web | Mac Desktop | Safari | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 | |
| 206589 | 213447 | mhewnxesx9 | -unknown- | 49 | basic | 0 | en | direct | direct | linked | Web | Windows Desktop | Chrome | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 |
| 206590 | 213448 | 6o3arsjbb4 | -unknown- | 32 | basic | 0 | en | direct | direct | untracked | Web | Mac Desktop | Firefox | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 |
| 206591 | 213449 | jh95kwisub | -unknown- | 49 | basic | 25 | en | other | other | tracked-other | iOS | iPhone | Mobile Safari | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 |
| 206592 | 213450 | nw9fwlyb5f | -unknown- | 49 | basic | 25 | en | direct | direct | untracked | iOS | iPhone | -unknown- | NDF | 5464 | 0 | 5464 | 2014 | 6 | 30 | 0 | 27 | 2029 | 6 | 15 | 4 | 24 | 2014 | 6 | 30 | 0 | 27 |